Deep multimodal fusion for semantic image segmentation: A survey

نویسندگان

چکیده

Recent advances in deep learning have shown excellent performance various scene understanding tasks. However, some complex environments or under challenging conditions, it is necessary to employ multiple modalities that provide complementary information on the same scene. A variety of studies demonstrated multimodal fusion for semantic image segmentation achieves significant improvement. These approaches take benefits sources and generate an optimal joint prediction automatically. This paper describes essential background concepts relevant applications computer vision. In particular, we a systematic survey methodologies, datasets, quantitative evaluations benchmark datasets. Existing methods are summarized according common taxonomy: early fusion, late hybrid fusion. Based their performance, analyze strengths weaknesses different strategies. Current challenges design choices discussed, aiming reader with comprehensive heuristic view segmentation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards Robust Semantic Segmentation using Deep Fusion

Robust semantic scene understanding of unstructured environments is critical for robots operating in the real world. Several inherent natural factors such as shadows, glare and snow make this problem highly challenging, especially using RGB images. In this paper, we propose the use of multispectral and multimodal images to increase robustness of segmentation in real-world outdoor environments. ...

متن کامل

Robust Semantic Segmentation using Deep Fusion

متن کامل

Survey on Multimodal Medical Image Fusion Techniques

An Image fusion is the development of amalgamating two or more image of common characteristic to form a single image which acquires all the essential features of original image. Nowadays lots of work is going to be done on the field of image fusion and also used in various application such as medical imaging and multi spectra sensor image fusing etc. For fusing the image various techniques has ...

متن کامل

Season-Invariant Semantic Segmentation with a Deep Multimodal Network

Semantic scene understanding is a useful capability for autonomous vehicles operating in off-roads. While cameras are the most common sensor used for semantic classification, the performance of methods using camera imagery may suffer when there is significant variation between the train and testing sets caused by illumination, weather, and seasonal variations. On the other hand, 3D information ...

متن کامل

Multimodal medical image fusion based on Yager’s intuitionistic fuzzy sets

The objective of image fusion for medical images is to combine multiple images obtained from various sources into a single image suitable for better diagnosis. Most of the state-of-the-art image fusing technique is based on nonfuzzy sets, and the fused image so obtained lags with complementary information. Intuitionistic fuzzy sets (IFS) are determined to be more suitable for civilian, and medi...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Image and Vision Computing

سال: 2021

ISSN: ['0262-8856', '1872-8138']

DOI: https://doi.org/10.1016/j.imavis.2020.104042